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(57) Abstract: Disclosed are compounds of formula (I):in which D is a dye selected from a cyanine dye or a derivative thereof; B 
is an afiRnity tag;F comprises a target bonding group selected fix>m a carboxylic acid thioester group and a 1,2-aminothiol group; 
M is a group adapted for attaching to F; and L^ and U each independently comprise a group containing &x)m I to 40 linked atoms 
selected from carbon atoms which may optionally include one or more groups selected from -NR*- . -O- , -CH=CH- . -CO-NH- and 
phenylenyl groups, where R* is selected from hydrogen and Ci - C4 alkyl. The invention also relates to methods that afford direct 
attachment of the cyanine dye reporter group to either the N-terminus or C-terminus of a synthetic or recombinant peptide or protein, 
and their derivatives, in a site-specific manner, coupled with purification of the resultant labelled molecule. 
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Site-speclfic Labelling of Proteins using Cvanine Dve Reporters 

Tlie present invention relates to' reagents and methods for site-specific 
labelling of proteins using cyanine dyes as reporter molecules. In particular, 
5 the invention relates to new cyanine dye derivatives containing thioester 
activated groups and groups reactive with target molecules containing or 
derivatised to contain a thioester reactive moiety. 



Ther^ is increasing interest in, and demand for, fluorescent reporters 
1 0 for use in the labelling and detection of biomolecules. Cyanine and related 
dyes such as rigidised cyanine dyes and squaraines offer a number of 
advantages over other fluorescent dye reagents and they are finding 
widespread use as fluorescent labels in such diverse areas as sequencing, 
microarrays, flow cytometry and proteomics. For example, US 5569587 
1 5 (Waggoner et al) discloses water soluble cyanine dye derivatives that possess 
reactive groups suitable for reaction with target molecules that contain, or are 
derivatised to contain, -OH, -NH2, or -SH groups. The cyanine dyes are 
characterised by having very high extinction coefficients and favourable 
quantum yields. In addition, cyanine dyes possess good photostability and 
20 are not readily photobleached. 



In many applications there is a need to form a pemianent link, in the 
form of a covalent bond, between a fluorescent labelling dye and a target 
molecule such as a protein. The chemistry of peptide and protein labelling is 
25 well documented and a wide range of labelling reagents are now commercially 
available. For a review and examples of protein labelling using fluorescent 
labelling reagents, see "Non-Radioactive Labelling, a Practical Introduction", 
Garman, A.J. Academic Press, 1997; "Handbook of Fluorescent Probes and 
Research Chemicals", Haugland, R.P., Molecular Probes Inc., 1992). 

30 

Site-specific incorporation of a fluorescent label into a protein or 
peptide may be of considerable benefit in certain biochemical and biophysical 
studies, for example fluorescence resonance energy transfer, and protein 
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structure and function studies. One method for the site-specific attachment of 
a fluorescent iabel into a target polypeptide utilises the native chemical 
ligation reaction. According to this procedure, an unprotected peptide 
fragment containing an N-temiinal cysteine residue and a second unprotected 
5 peptide fragment containing an a-thioester group are chemoselectively ligated 
together at physiological pH, irrespective of their primary sequences, to 
generate an amide bond at the ligation site. For examples, see reviews by 
Cotton, G.J. and IVIulrT.W., Chem. Biol., (1999), 6, R247-260; Giriat. I., Muir, 
T.W. and Perler, F.B., Genetic Engineering, (2001), 23, 171-199; Muir, T.W., 
1 0 Syn. Lett:, (2001 ), 6, 733-740. 

Tolbert, T.J. and Wong, C-H. (Angew. Chem. Int. Ed.. (2002), 41, 2171- 
2174) describe the preparation of fluorescein and blotin thioester derivatives 
and the reaction of these with N-terminal cysteine-containing recombinant 
15 proteins. Schuler, B. and Pannell, L.K. (Bioconjugate Chemistry, published on 
line, 18 July 2002) reported the preparation of a benzyl thioester of Cy5™ and 
subsequent reaction with a synthetic polypeptide containing an N-terminal 
cysteine residue. 

20 However, there are no reports describing thioester derivatives of 

cyanine dyes In which the reporter is also linked covalently to an affinity tag. 
Use of such a reagent in reactions Involving site specific labelling of proteins 
and peptides will be advantageous for subsequent separation and purification 
of the fluorescent dye-labelled target. The present invention therefore 

25 provides new cyanine dye reagents and methods that afford direct attachment 
of the cyanine dye reporter to either the N-terminus or C-terminus of a 
synthetic or recombinant peptide or protein and their derivatives, in a site- 
specific manner, coupled with purification of the resultant labelled molecule. 



30 



According to one aspect of the present invention, there is provided a 
compound comprising a cyanine dye or derivative thereof containing, at least 
one target bonding group selected from a carboxyiic acid thioester group or a 
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group suitable for covalent reaction with a thioester, characterised in that said 
compound includes an affinity tag covalently bound thereto. 

Suitably, the compound is of formula (I): 

D U M L2 B 



10 (I) 
wherein: 

D is a dye selected from a cyanine dye or a derivative thereof; 
B is an affinity tag; 

F comprises a target bonding group selected from a carboxylic acid thioester 
1 5 group and a 1 ,2-aminothiol group; 

M is a group adapted for attaching to F; and 

and each independently comprise a group containing from 1-40 linked 

atoms selected from carbon atoms which may optionally include one or more 

groups selected from -NR'-, -0-, -CH=CH-, -CO-NH- and phenylenyl 
20 groups, where R' is selected from hydrogen and Ci - C4 alkyl. 

Suitably, there are 2 to 30 atoms in each of L'* and L^, preferably, 6 to 
20 atoms. 

25 Preferably, and are independently selected from the group: 

-{(CHR)p-Q-(CHR')r}s- 



30 



where Q is selected from: -CHR 
-CO-NH-; R' is hydrogen or Ci - 



. -NRX -0-.-CH=CH-,-Ar-and 
C4 alkyl, p is 0 - 5, r is 1 - 5 and s is 1 or 2. 
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Particulariy preferred Q is selected from: -CHR'-, -O- and -CO-NH-, 
where R' is liereinbefore defined. 



In one embodiment is a cleavable linker and may additionally include 
5 group P which may be suitably selected from a chemically-cleavable group, 
an enzyme-cleavable group, or a photochemically-cleavable group. Suitable 
chemically cleavable groups include carbamate esters and carboxylate esters, 
which are both cleaved under basic conditions. Suitable enzyme cleavable 
groups may be selected from groups such as ester, amide and phospho- 

1 0 diester groups. Such groups are substrates for. and are hydrolysed by 

hydrolases, such as proteases, esterases and phospho-diesterases. Suitable 
photocleavable groups P for use in the compound of formula (I) may contain 
the 4,5-dlalkoxy-2-nltrobenzyl alcohol linker (Holmes, CP., and Jones, D.G., 
J.Org.Chem., (1995), 60, 2318-2319) or phenacyl linkers (Wang, S., 

1 5 J.Org.Chem., (1976), 41 , 3258-3261). These groups undergo efficient 

photoreaction upon 300nm illumination, resulting in the rapid cleavage of the 
dye molecule or dye-labelled protein from the affinity tag. 

Suitably, the group M may be any suitable functional group adapted for 
20 attaching the target bonding group F. Preferably. M is selected from: 

\ \ 
N — and CR*— 

/ / 
wherein R' is hereinbefore defined. 

25 

Suitable affinity tags may be selected from biotin, desthiobiotin and 
metal chelating ligands such as his-tag and iminodiacetic acid, nitrilotriacetic 
acid and the lil^e. Preferred affinity tags are selected from biotin and 
desthiobiotin. 

30 

In one embodiment of the present invention, the target bonding group F 
is a carboxylic acid thioester of formula: 
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P 



S— R' 



10 



15 



20 



25 



wherein U Is a bond or is a group containing from 1-30 linked atoms 
selected from carbon atoms and optionally one or more groups selected from 
-NH-, -O- and -CO-NH-; and R" is Ci - C4 alkyi, Ce- C10 aryl, or C7- C15 
aralkyi, which may be optionally substituted with sulphonate; or is the group 
-(CH2)2-CONH2. In the case where V is a bond, the target bonding group F 
is attached directly to group M. 

In an altemative embodiment, the target bonding group F is a 1,2- 
aminothiol group of formula: 



wherein L' is hereinbefore defined. 

Thus, the present invention provides fluorescent labelling reagents 
comprising a cyanine dye or derivative thereof, that are modified by 
incorporating a target bonding group and an affinity tag into the molecule. 
The target bonding group may be selected from a carboxylic acid thioester 
group or a 1 ,2-aminothiol group. Where the target bonding group is a 
thioester group, it is selectively reactive with a 1 ,2-aminothiol group on a 
target molecule, suitably a protein or peptide, or a derivative thereof. In the 
altemative, the cyanine dye may contain a 1 ,2-aminothiol group for reaction 
with a thioester group on the target. The incorporation of a reactive thioester 
or, altematively, a 1 ,2-aminothiol functionality into the chemical structure of 
the reporter molecule enables the target molecule to be directly labelled in a 
convenient one step process. According to the methods of the invention, 
labelling of peptides and proteins is site-specific, irrespective of the 
composition of the primary sequence. By generating the target primary 





wo 2004/011SS6 



-6- 



PCT/GB2003/003196 



sequence with either an N-temninal cysteine or a thioester functionality, site- 
specific labelling can be achieved directly, by incubating the target with the 
appropriate derivative of the cyanlne dye, suitably, the thioester and 1,2- 
aminothiol derivatives respectively. Furthermore, Inclusion of an affinity tag in 
5 the labelling reagent allows subsequent purification of the fluorescent dye 
labelled protein or peptide. 

Suitably, the cyanine dye or cyanine dye derivative may be selected 
from cyanine dyes, rigidised cyanine dyes and squaraine dyes, provided that 
1 0 the dye incorporates at least one carboxylic acid thioester group, or a group 
suitable for covalent reaction with a thioester. Table 1 shows some examples 
of cyanine dyes, having . particular excitation (Abs) and emission (Em) 
characteristics. 

15 Table 1 



Dve 


Fluorescence Colour 


Abs (nm) 


Em (nm) 


Cy2 


Green 


489 


506 


Cy3 


Orange 


550 


570 


Cy3.5 


Scarlet 


581 


596 


Cy5 


Far red 


649 


670 


Cy5.5 


Near-IR 


675 


694 


Cy7 


Near-IR 


743 


767 



in one embodiment according to the first aspect, the compound has the 
formula (II): 



20 



25 




wherein: 
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groups and R"* are attached to the ring structure and groups R^ and R® 
are attached to the 7} ring structure; 
n is an integer from 1 to 3; 

7} and independently represent the atoms necessary to complete one ring 
5 or two fused ring aromatic or heteroaromatic systems, each ring having five or 
six atoms selected from carbon atoms and optionally no more than two atoms 
selected from oxygen, nitrogen and sulphur; 

X and Y are the same or different and are selected from: >CR®R®, oxygen, 
sulphur, -CH=CH-, >N-W wherein N is nitrogen and W is selected from 
1 0 hydrogen and the group R^*^; 

at least one of groups R\ R^, R^. R^, R^ R^. R®, R® and p}^ is the group: 



where B, F, M, \} and are hereinbefore defined; 

groups R^ are independently selected from hydrogen and Ci - C4 alky! which 
may be unsubstituted or substituted with aryl, or two or more of R^ together 
with the group: 



form a hydrocarbon ring system substituted with R^and which may optionally 
contain a heteroatom selected from -S- or >NR'^, wherein R^ and n are 

25 hereinbefore defined; 

remaining groups R^, R"^, R^ and R^ are independently selected from the 
group consisting of hydrogen, halogen, amide, cyano, nitro. mono- or di-Ci - 
Ce alkyl-substituted amino, carbonyl, carboxyl, Ci - Ce alkyi, Ci - Ce alkoxy, 
aryl, heteroaryl, aralkyi and the group -(CH2)m-Y where Y is selected from 

30 sulphonate, sulphate, phosphonate, phosphate and quaternary ammonium 
and m is zero or an integer from 1 to 6; 

remaining groups R®, R® and R^^ are independently Ci ^ Ce alkyI; and 



_L1 



M 



15 



20 
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remaining groups and are independently selected from hydrogen, Ci - 
Cio alkyi, the group -(CH2)m-Y wherein Y and m are hereinbefore defined, 
and benzyl which may be unsubstituted or substituted by up to two nitro 
groups. 

5 

In a second embodiment according to the first aspect, the compound 
has the formula (III): 

10 




15 

(III) 

wherein 

groups R^^, R^^, R^* and R^® are attached to the rings containing X and Y or, 
optionally are attached to atoms of the and ring structures; 
20 Z^ and Z^ independently represent the atoms necessary to complete one ring 
or two fiised ring aromatic or heteroaromatic systems, each ring having five or 
six atoms selected from cartoon atoms and optionally no more than two atoms 
selected from oxygen, nitrogen and sulphur; 

X and Y are the same or different and are selected from: >CR®R®, oxygen, 
25 sulphur, -CH=CH-, >N-W wherein N is nitrogen and W is selected from 
hydrogen and the group R^°; 

A is selected from O and NR^^ where R^^ is the substituted amino radical: 

— N 

30 ^R« 



at least one of groups R», R^. R^° F}\ R«, R^^ R^*. R'^ R" and R" is the 
group: 
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— L1 M L2 B 

F 

5 where B, F, M, and i? are hereinbefore defined; 

remaining groups R^^, R^^, R" and R^® are independently selected from 
the group consisting of hydrogen, halogen, amide, cyano, nitro, amino, mono- 
or di-Ci - Ce alkyl-substituted amino, carbonyl. carboxyl, Ci - Ce alkyi, Ci - Ce 
alkoxy. aryl, heteroaryl, aralkyi and the group -(CH2)m-Y where Y is selected 

1 0 from suiphonate, sulphate, phosphonate, phosphate and quaternary 
ammonium and m is zero or an integer from 1 to 6; 
remaining groups R®, R® and R^° are independently Ci - Ce alkyI; 
remaining group R^^ is selected from hydrogen. Ci - C4 alkyI and aryl; and 
remaining group R^® is selected from Ci - Ce alkyI, aryl, heteroaryl, an acyl 

1 5 radical having firom 2-7 carbon atoms, and a thiocarbamoyi radical. 

Suitably, in the compounds according to fonmula (11) and (III), 7? and I? 
may be selected independently from the group consisting of phenyl, pyridinyl, 
naphthyl, anthranyl, indenyl, fluorenyl, quinolinyl, indolyl, benzothiophenyl, 
20 benzofuranyl and benzimidazoiyi moieties. Additional one, or two fused ring 
systems will be readily apparent to the skilled person. Preferably, and 2? 
are selected from the group consisting of phenyl, pyridinyl, naphthyl, quinolinyl 
and indolyl moieties. Particularly preferred 7} and 7? are phenyl and naphthyl 
moieties. 

25 

Suitably, at least one of the groups R of the compounds of fomnula (11) 
and (III) is a water solubilising group for confem'ng a hydrophilic characteristic 
to the compound. Solubilising groups, for example, suiphonate, sulphonic 
acid and quaternary ammonium, may be attached directly to the aromatic ring 
3 0 structures and/or 7? of the compounds of fomnula (i I) and (III). 

Alternatively, solubilising groups may be attached by means of a Ci to Ce alkyI 
linker chain to said aromatic ring structures and may be selected firom the 
group -(CH2)m-Y where Y is selected from suiphonate, sulphate. 



wo 2004/011556 




PCT/GB2003/003196 



phosphonate, phosphate, quaternary ammonium and carboxyl; and m is 
hereinbefore defined. Altemative solubilising groups may be carbohydrate 
residues, for example, monosaccharides, or polyethylene glycol derivatives- 
Examples of water solubilising constituents include Ci - Ce alky! sulphonates, 
5 such as -(CH2)3-"S03"" and -(CH2)4~S03''. However, one or more sulphonate 
or sulphonic acid groups attached directly to the aromatic ring structures of a 
dye of formula (II) or (III) are particulariy preferred. Water solubility may be 
advantageous when labelling proteins. 

10 In one embodiment the compound of formula (i) is a fluorescent 

reporter molecule. In this embodiment, none of the substituent groups R in 
the compounds of formula (II) and (III) contains a nitro group. 

In another embodiment, the compound of formula (I) is non-fluorescent 
15 or substantially non-fluorescent dye wherein at least one of the groups R 
attached to the aromatic ring structures of the compounds of formula (II) and 
(III) comprises at least one nitro group. In this embodiment, suitably, the at 
least one nitro group may be attached directly to the and/or ring 
structures. In the altemative, a mono- or di-nitro-substituted benzyl group 
20 may be attached to the and/or ring structures which optionally may be 
further substituted with one or more nitro groups. The non-fluorescent or 
substantially non-fluorescent cyanine dye or cyanine dye derivatives 
according to the invention may be used to label one component of a 
fluorescent donor/acceptor pair in assays involving the detection of binding 
25 and/or cleavage events in reactions involving biological molecules, as 
described in EP 1086179 B1 (Amersham Biosciences UK Limited). 

In the embodiments according to the first aspect: 
i) Aryl is an aromatic substituent containing one or two fused aromatic 
30 rings containing 6 to 10 carbon atoms, for example phenyl or naphthyl, the 
aryl being optionally and independently substituted by one or more 
substituehts, for example halogen, straight or branched chain allcyl groups 
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containing 1 to 10 carbon atoms, aralkyi and alkoxy for example methoxy, 
ethoxy, propoxy and n-butoxy; 

ii) Heteroaryl is a mono- or bicyclic 5 to10 membered aromatic ring 
system containing at least one and no more than 3 heteroatoms which may be 

5 selected from N, O, and S and is optionally and independently substituted by 
one or more substituents, for example halogen, straight or branched chain 
alkyi groups containing 1 to 10 carbon atoms, aralkyi and alkoxy for example 
methoxy, ethoxy, propoxy and n-butoxy; 

iii) Aralkyi is a Ci - Ce alkyl group substitute by an aryl or heteroaryl 
10 group; 

iv) Halogen and halo groups are selected from fluorine, chlorine, bromine 
and iodine. 



By virtue of the target bonding group F, the compounds according to 
1 5 the present invention are useful for covalently labelling target biological 
materials in a site specific manner for applications in biological detection 
systems. Suitable target materials include proteins, post-translationally 
modified proteins, peptides, antibodies, antigens, and protein-nucleic acids 
(PNAs). The reporter moiety may also be conjugated to species which can 
20 direct the path of the reporter within or aid entry to or exit from cells (live or 
dead); such as for example, long alkyl residues to allow pemieation of 
lipophilic membranes, or intercalating species to localise a reporter in a 
nucleus or other cellular enclave containing double-stranded DNA. 

25 In a second aspect, there is provided a method for labelling a protein of 

interest wherein said protein contains or is derivatised to contain an N- 
terminal cysteine, the method comprising: 

i) adding to a liquid containing said protein a compound of formula (I): 
30 D M L2 B 

F 
(I) 
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wherein: 

D is a dye selected from a cyanine dye or a derivative thereof; 
B is an affinity tag; 

F comprises a target bonding group selected from a carboxylic acid thioester 
5 group and a 1 ,2-aminothiol group; 

M is a group adapted for attaching to F; and 

and each independently comprise a group containing from 1-40 linked 
atoms selected from carbon atoms which may optionally include one or more 
groups selected from -NR-, -0-, -CH=CH-, -CO-NH- and phenyleriyi 
1 0 groups, where R' is selected from hydrogen and Ci - C4 alkyi; and 

li) incubating said compound with said protein under conditions suitable 
for labelling said protein. 

Suitably, there are 2 to 30 atoms in each of and L^, preferably, 6 to 
1 5 20 atoms. 

Preferably, and are independently selected from the group: 

-{(CHR%-Q-(CHROr}s- 

20 

where Q is selected from: -CHR -NR ~CH=CH-, -Ar- and 
-CO-NH-; R* is hydrogen or Ci - C4 alkyI, p is 0 - 5, r is 1 - 5 and s is 1 or 2. 

Particularly preferred Q is selected from: -CHRS -O- and -CO-NH-, 
25 where R' Is hereinbefore defined. 

Preferred compounds of formula (I) for use in labelling a target protein 
are those having fonmula (II) or (III) as hereinbefore defined. 



30 



Covalent labelling using compounds of the present invention may be 
accomplished with a target having at least one carboxylic acid thioester group 
or 1 ,2-amlnothiol group as hereinbefore defined. The target may be Incubated 
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with an amount of a compound of the present Invention having at least one 
group F as hereinbefore defined that can covalently bind with the 
complementary group of the target material. The target material and the 
compound of the present invention are incubated under conditions and for a 
5 period of time sufficient to pemnit the target material to covalently bond to the 
compound of the present invention. Thus, for example, the thioester group F 
may be reacted and form a covalent bond with any of the above target 
materials that contains, or has been derivatised to contain, a 1 ,2-amlno thiol 
group. These methods and the products resulting from them, for example, 
1 0 reporter-labelled biomolecules are envisaged as further aspects of the 
invention. 

Suitably, the protein of interest may be selected from the group 
consisting of antibody, antigen, protein, peptide, microbial materials, cells and 
1 5 cell membranes. 



In a particular embodiment according to the second aspect, there is 
provided a method of separating and/or purifying the dye-labelled protein of 
interest by affinity chromatography utilising the affinity of the affinity tag moiety 

20 for an immobilised ligand (or specific binding partner) attached to a support 
material. Affinity chromatography provides a quick and convenient method to 
enable the separation of labelled and unlabelled protein molecules under 
physiological conditions. Proteins labelled with an affinity tag can be 
selectively bound to an affinity column and any unreacted protein removed by 

25 washing the column. Suitable specific binding moieties include avidin or 
streptavidin (for a biotin tag); immobilised metal ions, for example, Cu(ll), 
Ni(ll), Fe(ll) and Fe(lll) (for His-tag or iminodiacetic acid). Methods for affinity 
purification of proteins will be well known to the skilled person, see for 
example Ostrove, S, Methods in Enzymology, (1990), Vol 182, page 357. 



in a typical labelling procedure, a target peptide or protein containing 
an N-terminal cysteine residue is agitated with an excess of a cyanine dye 
thioester derivative, e.g. CyS-MESNA (Cy5-mercaptoethanesulphonic acid 
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ester). In phosphate buffer (typically 200 mM NaCI, 200 mM sodium 
phosphate) at -pH 7.3 - 7.4 containing -^ .5% MESNA. The concentration of 
the target polypeptide in the labelling reaction is generally between 100 |iM to 
10 mM, whilst the Cy5-MESNA is generally present in excess, for example 1.5 
5 to 3-fold molar excess. When the target polypeptide concentration is relatively 
low the concentration of Cy5-MESNA is usually maintained at or above 1 mM. 
Generally, for labelling small peptides a solution of Cy5-MESNA and MESNA 
cofactor is directly added to the lyophilised target. 

1 0 Typically, for site specific labelling of proteins and large polypeptides 

using the reagents of the present invention, the target is first exchanged into 
an appropriate buffer, which is known not to affect the labelling reaction. An 
equal volume of a solution of Cy5-MESNA and MESNA thiol cofactor In 
ligation buffer is then added to the protein to give the desired final 

1 5 concentration of the reactants. The reaction mixture is agitated overnight at 
room temperature. The reaction time may be lowered to less than one hour 
for high reactant concentrations or, if the stability of the target polypeptide is 
an issue, the labelling reaction may be performed efficientiy at 4^C. On 
completion of the labelling reaction, dithiothreitol (DTT) is added to a final 

20 concentration of -50 mM and the desired material isolated by affinity 
chromatography. 

Various different denaturants, organic solvents and detergents may be 
added to the reaction buffer when perfomiing native chemical ligation and 

25 expressed protein ligation reactions, to aid the ligation of the peptide 

fragments and/or stabilise the reactants or products. Such reagents may be 
utilised in the labelling reaction to increase product yield if necessary. 
Examples include, but are not limited to guanidinium chloride, urea, 
dimethylformamide, dimethylsulphoxide, acetonitrile, tritonX-100, octyl 

30 glucoside, 1 .G-hexanediol and glycerol. 

The ligation reaction using the derivatised cyanine dye according to the 
present invention may be optimally perfomned at between pH 7.0 and pH 8.0 
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and at temperatures varying between 4°C and 37^C. It is envisaged that 
such a range of conditions are compatible to the site-specific labelling reaction 
described herein. 

5 The advantage of the present method is that it enables the introduction 

of an extrinsic label into a proteinacious substrate in a regioselective and 
specific manner, thus minimising any detrimental effects that labelling may 
have on the biological function of the protein. The importance of controlling 
stoichiometry of labelling is important where dye overioad may interfere with 
1 0 biological activity. In addition, if this controlled labelling stoichiometry is 
directed towards a single terminal site, rather than towards an internal site, 
this may have the benefit of further maintaining the biological viability of the 
labelled species. 

1 5 The invention is further illustrated by reference to the following 

examples and figure in which: 

Figure 1 illustrates the products from the labelling reaction of an N-terminal 
cysteine derivative of the Grb2SH2 domain with the thioester derivative, a-D- 
20 desthiobiotin-8-Cy5-Ulysine-MESNA according to Examples 3 and 4. 

Experimental 

1 2-r(1E.3g.5a-5-f3.3-Dimethvl-146H3xo-6-r(2-suiphoethvnthio1hexvll-5> 
25 sulfo-1 .3-dihvdro-2H-indol-2-vlidene)penta-1 .3-dienvlM -ethvl-3.3-dimethvl-5-> 
sulfo-3HHndolium 
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To Cy™5 mono acid (47mg, 0.062mmol) In a solution of 7- 
azobenzotria2olyoxytris(pyn-olldlno)phosphonlum hexafluorophosphate 
(PyAOP, 66mg, 0.127mmol) in anhydrous dimethylfonnamlde (DMF, 1ml) was 
added anhydrous di-lsopropylethylamlne (DIEA)(30nl. 0.1724mmol) and 
5 mixed for 5 minutes. The activated dye solution was then added to a stirred 
solution of 2-mercaptoethanesulphonic acid, sodium salt (MESNA, 40mg. 
0.243mmol) In DMF (2mls) and DIEA (30pl, 0.1724mmol) under a dry nitrogen 
atmosphere. To this mixture was added as a solid, dried 4A molecular 
sieves(~1g, <5mlcron, activated powder). The mixture was stinred under a dry 
10 nitrogen atmosphere, at room temperature. In the dark overnight. Thin layer 
chromatography analysis (reverse phase C18 plates, eluents 
water/acetonitrile (70:30, containing 0.1% TFA) indicated a major component, 
Rfttiioester = 0.25 ) with no trace of starting material (Rfadd = 0.12 ). 

1 5 The molecular sieves were removed by filtration and filtrate was added 

dropwise Into an excess of ethyl acetate, the blue solid was filtered off and 
was purified by reverse phase-high performance liquid chromatography (RP- 
HPLC): [Phenomenex Prodigy CI 8 column; 15%B-30%B over 30 mins @ 20 
ml/min; eluent A = 0.1%TFA/water, eluent B = 0.1 %TFA.MeCN, UV detection 

20 at 650nm]. The product vt^s isolated as a dark blue/purple solid (40 mg. 
0.051 3mmol, 83 % yield). 

Accurate mono-isotopic mass: C35H45O10N2S4 requires 781 . Found 
Maldi Tof. LC-MS found mass: M+ 781.25. 5 H (300MHz. d6-DMSO): 8.37 (t, 
25 1H), 8.36 (t, 1H). 7.83 (d, 1H), 7.82 (d. 1H). 7.67(dd, 1H). 7.64 (dd, 1H), 7.36 
(d, 1H), 7.33 (d. 1H), 6.61 (t, 1H), 6.38 (d. 1H). 6.28 (d, 1H). 4.15 (m, 2H), 
4.08 (t, 2H), 3.06 (m, 2H), 2.63 (m. 2H), 2.56 (t, 2H), 1.64 (m, 2H), 1.28(t, 3H. 
7.1), 1.40 (m. 2H). W (abs>« 647nm. (s (H2O) - 230,000M-^cm-^). 



30 
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2. Determination of Specificity of Labelling using 2-r(1£.3E. 5EV5-(3.3- 
dimethvl-1-l6-oxo-6-r(2-sulDhoethvnthio1hexvlV5-sulfo-1.3-dihvclro-2H-indol-2- 
Vlidene)penta-1 .3-dienvn-1-ethvl-3.3-dimethvl-5-sulfo-3H-indoliunfi 

5 2.1 Preparation of Cv5-Cvs-Glv-Leu-Asp-Lvs-Ara -Glv-Cvs-Glv-NHg 

j) Synthesis of H-CvsrTrtVGIv-Leu-AspfOtBuVLvsfBocV-Araf Pmc)-Glv- 
CvsfrrtV-Glv-rini< amide resin 

1 0 H-Cys(Trt)-Gly-Leu-Asp(Oeu)-Lys(Boc)-Arg(Pmc)-Gly-Cys(Trt)-Gly- 
rinlc amide resin was synthesised using a commercially available Applied 
Biosystems Model 433A automated peptide synthesiser using FastMocTM 
chemistry, following the Instrument manufacturer's recommended procedures 
throughout. The peptide was synthesised on a 0.25 millimolar scale 

1 5 employing 0-(benzotriazol-1 -yl)-1 , 1 ,3,3-tetramethyluronium 
hexafiuorophosphate ^BTU) as the activating agent. 

ii) H-Cvs-Glv-Leu-Asp-Lvs-Ara-Giv-Cvs-Glv-NH!^ 




O 




V 



o = o i o 



20 H-Cys(Trt)-Gly-Leu-Asp(OtBu)-Lys(Boc)-Arg(Pmc)-Gly-Cys(Trt^^ 
amide resin (100mg, tlieoretical loading 0.36mmol/g) was deprotected and 
cleaved from solid phase In 95% trifluoroacetic acid (TFA) / 2.5%trl- 
isopropylsilane (TIS) / 2.5% water (3 mis) at room temperature for 2 hours- 
The crude product was precipitated into a 10 fold excess of cold diethyl ether, 

25 centrifuged at 2500 rpm for 5 minutes and the ether decanted off. The crude 
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peptide was washed twice more with ether and was purified by reverse phase- 
high performance liquid chromatography (RP-HPLC) [Phenomenex Jupiter 
C18 column, eluent A: 0.1%TF A/water, eluent B: 0.1%TFA/acetonltrile, 
gradient : 0-73%B over 30 mins @1ml/min, detection at 214nm ]. The product 
5 was Isolated and lyophillsed to afford a colourless fluffy solid (21 mg by weight, 
60%). Mono-lsotopic mass: 906.4. Found mass (LC-MS): MH+ @ 907.3; 
M+Na @ 929.6; > 95% pure as judged by RP-HPLC @ 214nm (Phenomenex 
Jupiter C1 8 column, eluent A: 0.1%TFA/water, eluent B: 0.1%TFA/acetontiile, 
5-50% B over 25mins @ 1mi /min, UV detection at 650nm). 



10 



ill) Cv5-Cvs-Glv-Leu-AsD-Lvs-Ara-Glv-Cvs-Glv-NH;» 



To solid H-Cys-Gly-Leu-Asp-Lys-Arg-Gly-Cys-Gly-NH2 (3.0mg by 
weight, 0.0033mmol) was added a solution of Cy5-MESNA (3.5mg, 
1 5 0.0045mmol) in 200mM phosphate buffer, 200mM NaCI pH 7.2 containing 
1.5% 2-mercaptoethanesulphonic acid, sodium salt (400^1). The reaction 
mixture was stirred on rollers for 30 minutes at room temperature in darkness. 
During incubation, a blue precipitate fonnned, which re-dissolved on addition of 
acetonitrile (40jil). 

20 

500mM DTT (200^1) in 200mM phosphate buffen 200mM NaCI pH 7.2 
(0.5mls. 0.0025mmol) was then added to the reaction mixture, with complete 
mixing and was stirred for a further 30 minutes at room temperature in the 
dark. The crude reaction mixture was then purified by RP-HPLC 

25 [Phenomenex Jupiter CI 8 column, eluent A: 0.1 %TFA/water, eluent B: 

0.1%TFA/acetontrile, gradient; 20-35%B over 30 mins at 4 ml/min, detection 
at 650nm and 214nm]. The product was Isolated and lyophillsed as a blue 
fluffy solid (1 .6 mg by UVA/IS at 650nm;.50% Yield; 98% pure as judged by 
RP-HPLC at 650nm. Mono-isotopic mass C67H101N16O18S4 requires 

30 1 545.636. Found (LC-MS) M+ 1 545.7. 
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2.2 Characterisation of Labelled Peptide 

I) Ellman's Test on Cv5-Cvs-Giv-Leu-Asp-Lvs-Ara-Glv-Cvs-Glv-NHg 

5 A sample of CyS-Cys-Gly-Leu-Asp-Lys-Arg-Gly-Cys-Gly-NHa was 

dissolved in lOOmM sodium phosphate buffer; 1mM EDTA pH 7.27 (stocl( 
buffer) to afford a O.S^M peptide stocic by UVA/IS at 650nm. 

O.S^M peptide stocl^ (40^1) and 10mM 5.5'-dithiobls(2-nitrobenzoic acid 
1 0 (DTNB) in 100ml\/I sodium phosphate buffer; 1 mM EDTA pH 7.27 (50^1) were 
mixed together in stock buffer (910^i) to afford a green solution. The 
absorbance at 412nm (due to generation of TNB^') was recorded against a 
DTNB blanl^ [10mM DTNB stocl^ (50^1) in stocic buffer (950^1 ]. Using the 
l<nown molar absorption coefficient of TNB^' (14150M'''cm'^), the thiol 
1 5 concentration was determined as 655)xM, approximately twice the peptide 
concentration, confirming two free thiol groups. [SIH] = [A412nm (sample)- 
A412nm(reference)/8 (TNB^") 



ii) Enzyme Digestion of Cv5-Cvs-G!v-Leu-Asp-Lvs-Arq -GIv-Cvs-Glv-NH? 

20 

To a solution of Cy5-Cys-Gly-Leu-Asp-Lys-Arg-Gly-Cys-Gly-NH2 
(180ng by UVA/IS at 650nm) in TRIS buffer pH 8.0 (100^l) containing 10% 
acetonitrile was added Asp-N (2ng) in TRIS buffer pH 8.0 (70nl). The reaction 
mixture was stirred at room temperature in the dark for 4 hours. The reaction 

25 mixture was treated with 250mM Tris (2-carboxyethyl)phosphine, HCL (TCEP) 
in TRIS buffer pH 8.0 (55^1) for 30 minutes. The reduced reaction mixture 
was then diluted 1 :5 with 0.1%TFA in water and purified by reverse phase 
HPLC [Phenomenex Jupiter C18, eluent A: 0.1%TFA/water, eluent B: 
0.1%TFA/acetonitrile, 5-50% B over 30mins @ 1ml /min, UV at 214nm, 

30 650nm]. The two components of the reaction mixture were identified as : Cy5- 
Cys-Gly-Leu-OH, mono-isotopic mass: C44H60N5O11S3 requires 930.3451. 
Found mass (MALDI Tof): M+ 930.0 and H-Asp- Lys-Arg-Gly-Cys-Gly-NH2, 
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monoisotopic mass: C23H43N11O8S requires 633.3016, found mass (MALDI- 
Tof): M+ 633.0. 

3. Preparation of a-D-Desthiobiotin-e-Cv5-L-lvsine-MESNA rA/^-(64(2Z)-2- 
ff2E.4a-5-f1-ethvl-3.3-dimethvl-5-sulfo-3H-indollum-2-vnpenta-2.4- 
dienvlidene1-3.3-dimethv[-5-sulfo-2.3-dilivdro-1A^indol-1-vfthexanovl)-A/^-(6- 
ff6-(5-methvl-2-oxoimidazolidin-4-vRhexanovnamino)hexanovl) Ivsvlthio 
etliane-2-sulfonic acidi 




3.1 Preparation of a-Fmoc-s-Cv5-L-lvsine-OH f2-r(1E.3EV5-(1 464(5- 
carboxv-5-ff(9H-fluoren-9-vlmethoxv)carbonvnamlnotoentvRaminoT-6- 
oxohexvft-3.3-dlmethvl-5-sulfo-1 .3-dihvdro-2H-indo(-2-vlidene^-1 .3- 

pentadienvn-1-etlivl-3.3-dimethvl-5-sulfo-3/-/-indolium salt] 

Cy5 mono free acid potassium salt (Amersham Pharmacia Biotech Ltd) 
(450mg, 0.65 mmol) and DIEA (720^1) were dissolved in anhydrous 
dimethylsulphoxide (18ml). To this was added 0-(N-succinimidyl)-N,N,N',N'- 
bis(tetramethylene)-uronium hexafluorophosphate (666mg, 1.6mmol) and the 
reaction mixture stirred at room temperature for 1 hr after which time 
negligible starting material remained by TLC (RPCis, 1:1 methanol:water). 
The reaction mixture was slowly poured into diethyl ether to precipitate the 
product; Cy5 mono NHS ester, which was filtered off, washed with ethyl 
acetate and dried in vacuo. The product was re-dissolved in anhydrous 
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dlmethylsulphoxide (18ml) and DIEA (720^1) added. Fmoc-lysine-OH 
(360mg, 0.98mmol) was suspended in a mixture of pliosphate buffer (pH 7.4) 
(9ml) and dimethylsulphoxide (9ml). The suspension was slowly added to the 
solution of Cy5 NHS ester. The reaction mixture was stirred at room 
5 temperature for 12 hours. TLC (RPCis, 2:3 methanohwater) showed the 
disappearance of starting material and the formation of a new product spot. 

The product was purified by HPLC (Dynamax CIS column (50 x 
4.14cm); flow rate 25ml/min; gradient of 20 to 80% B over 80 mins (eluent A = 

10 0.1 % TFA in water and eluent B = 0.1 % TFA in acetonitrile); detection at 

650nm. The fractions containing the desired product were pooled and most of 
the solvent removed under reduced pressure, the residue was freeze dried. 
The product; a-Fmoc-e-Cy5-L-lysine-OH was obtained as a fluffy cyan solid 
(487mg, 74%). MS (MALDI TOP) found 1008(M*); [theoretical 

1 5 (C54H64N4O11S2) 1009]. ""H NMR (200 MHz DeDMSO) 1 .27(t. 3H. CH3). 1 .35 
(m. 4H, CH2. CH2), 1 .55 (m*s, 4H, CH2, CH2), 1 .7 (s. 1 2H. (CH3)2). 1 .78 (m, 
2H. CH2). 2.05 (t, 2H, CH2), 3.0. (m. 2H. CH2), 3.92 (m, 1H CH amino acid). 
4.11 (m, 4H, N-CH2. N*CH2), 4.27 (m 3H O-CH2. CH fluorenyl), 6.3 (d. 2H, a. 
a' methine). 6.59, (t. 1H, y methlne) 7.28-7.48 (m's, 6H, Fmoc and indole Ar). 

20 7.65 (d, 2H, fluorenyl Ar), 7.73 (d, 2H, fluorenyl Ar), 7.85 (s, 2H, indole Ar), 7.9 
(d, 2H, indole Ar), 8.38 (t. 2H, p, p' methine). 

3.2 Preparation of e-Cv5-L-lvsine-OH rA/^-(6-ff2a-2-K2E.4a-5-f1-ethvl- 
3.3-dimethvl-5-sulfo-3H-indolium-2-vnDenta-2.4-dienvlidene1-3.3-dimethvl-5- 
25 sulfo-2.3-dihvdro-1H-indol-1-vfthexanovlVvsine1 

a-Fmoc-e-Cy5-L-lysine-OH (lOOmg, O.lmmol) was deprotected In a 
mixture of 20% piperidine in NMP (2ml). TLC (RP C18; 1:1 MeOHnA^ater) 
showed the formation of a new product spot, rf = 0.92 as compared to that of 
30 the starting material, rf - 0.46. The piperidine was removed under reduced 
pressure and the dye precipitated by pouring the reaction mixture into diethyl 
ether. The product was filtered off and washed with dichloromethane and 
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then ethyl acetate to remove the yellow Fmoc derived by-product. The 
product was dissolved In water, filtered and then purified by HPLC; [Vydac 
Protein and Peptide C1 8 column; 0-50%B over 45mins at lOml/min; eluent A 
= 0.1% TFA/water, eluent B = 0.1% TFA/MeCN, detection at 215nm). 
5 Fractions containing the desired product were combined and the solvents 
removed under reduced pressure to leave a blue residue. The residue was 
triturated with ethyl acetate and the resultant solid dried under vacuum at 
40°C. The product; 8-Cy5-L-lysine-OH was obtained as a daric blue solid 
(43mg, 48%). Analytical HPLC AKTA analysis; Phenomenex C18 column; 0- 

1 0 50%B over 30mlns at 1 ml/min; eluent A « 0.1 % TFA/Water, eluent B = 0.1 % 
TFA/MeCN, detection at 650nm; rt = 20.22mins. MS (MALDI TOF) found 785 
(M*); [theoretical (C39H63N4O9S2) 785]. NMR (300 MHz DeDMSO) 1.26 (t, 
3H. CH3). 1.52 (m, 4H. CH2. CH2). 1.62 (m. 4H, CH2, CH2), 1.69 (S, 12H. 
(CH3)2). 2.02 (m, 2H. CH2). 2.92 (m, 2H, CH2), 3.85 (m, 1H. CH amino acid), 

15 4.10 (m. NCH2. N*CH2). 6.29 (d, 1H a methine), 6.34 (d, 1H, a' methlne), 6.58 
(t, 1H, y methine). 7.32 (m, 2H. indole Ar), 7.82 (d, 2H, indole Ar), 8.04 (m, 3H, 
NH3*). 8.37 (t, 2H. p, p' methine). 

3.3 Preparation of D-Desthioblotinamidocaproic acid 

20 

D-Desthlobiotin (250mg. 1.17mmol) was dissoved in anhydrous 
dimethylsulphoxide (2ml). To this solution was added PyAOP (610mg, 
1.17mmol) and DIEA (200^1, 1.15mmol). The mixture was stirred under 
nitrogen at RT for 3hrs before adding 6-aminocaproic acid (1 53mg, 1 .17mmol) 

25 and a further amount of DIEA (200^1, 1 .1 5mmol). The reaction mixture was 
stinred for a further 4hrs. TLC (RP CI 8; 1 :2 MeOH:waten detection by 
cinnamaldehyde staining) showed the formation of a new product spot, rf = 
0.63 as compared to the starting material, rf = 0.76). The reaction mixture 
was poured into excess diethyl ether to give a brown oil. The oil \a^s triturated 

30 with ethyl acetate until an off-white solid was obtained. The product was 
filtered off and purified by HPLC [Vydac Protein and Peptide CI 8 column; 0- 
50%B over 30mins at lOml/min; eluent A = 0.1% TFA/water. eluent B * 0.1% 
TFA/MeCN, detection at 215nm). Fractions containing the desired product 
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were pooled and the solvents removed under reduced pressure. The residue 
was triturated with ethyl acetate to give a white solid. The product was filtered 
off and dried under reduced pressure at 40°C. The product; D- 
desthiobiotlnamidocaproic acid, was obtained as a white solid (48mg, 12.5%). 
MS (MALDI TOF) found 327(M*); [theoretical (C16H29N3O4) 327]. NMR 
(300 MHz D6DMSO)0.96 (d. 3H. CH3). 1.25 (m, 6H. CH2, CH2. CH2). 1.34 (m, 
4H, CH2, CH2), 1.48 (m, 4H. CH2, CH2), 2.03 (m. 2H. C(0)CH2). 2.60 (m. 2H, 
C(0)CH2), 3.01 (m, 2H. NHCH2), 3.47 (m. 1H, CH), 3.60 (m, 1H. CH), 6.11 (s. 
1H. NH). 6.29 (s, 1H, NH), 7.71 (s, 1H, NH). 

3.4 Preparation of D-Desthlobiotinamidocaproate N-hvdroxv succlnimMvl 
ester 



D-Desthiobiotinamidocaproic acid (48mg, 0.147mmol) was dissolved in 
1 5 DMF (1 ml) and N.N,N',N'-bis(tetramethylene)-0-(N-succinimldyl)uronlum 

hexafiuorophosphate (HSPyU) (90mg, 0.21 mmol) and DIEA (40^1, 0.23mmol) 
were added. The reaction mixture was stirred under nitrogen at RT for 6hrs, 
TLC (RP CI 8; 1:2 MeOH:waten materials detected by cinnamaldehyde 
staining) showed the fonnation of a new product at the base line as compared 
20 to the starting material, rf - 0.68. The reaction mixture was poured into diethyl 
ether to give a brown gum. The supernatant was decanted off and the gum 
again treated with diethyl ether. No solid formed. The gum was dried under 
reduced pressure and the product, D-destiiiobiotinamidocaproate N-hydroxy 
succinimidyl ester was used directly in the next dye coupling reaction, 
25 assuming a theoretical yield of 62mg. 

3.5 Preparation of a-D-Desthiobiotin-s-Cv5-L-lvsine-OH rA/^-f6-lf2ZV-2- 
r(2E.4E)-5-n-ethvl-3.3-dimethvl-5-sulfo-3H-lndolium-2-vl)penta-2.4- 
dienvlidene1-3.3-dimethvl-5-sulfo-2.3-dihvdro-1H-indol-1-vl>hexanovlV-A/^-f6- 
30 ff6-f5-methvl-2-oxoimidazolidin-4-vnhexanovnaminolhexanovnivsin6] 



e-CyS-L-lysine-OH (43mg, 0.048mmoi), D-desthiobiotlnamidocaproate 
N-hydroxy succinimidyl ester (62mg, 0.146mmol) and DIEA (80^1, 0.45mmol> 
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were stirred together In DMF (2ml) for 3hrs. TLC (RP C18; 1:1 MeOH:water) 
showed the formation of a new product spot, rf = 0.79, just under that of the 
starting material. The product was precipitated into diethyl ether (200ml) and 
then filtered off. The material was purified in multiple runs by HPLC [Vydac 
5 Protein and Peptide C18 column; eluent A = 0.1% TFA/water, eluent B = 0.1% 
TFA/MeCN, various gradients, detection at 215nm) until the material was 
seen to be pure by NMR. Analytical HPLC AKTA analysis; Phenomenex 
CI 8 column; 0-50%B over SOmins at Iml/min; eluent A = 0.1% TFA/water, 
eluent B = 0.1% TFA/MeCN, detection at 650nm; rt - 22.04mins. MS (MALDi 
10 TOF) found 1 094 (M*); [theoretical (C55H80N7O12S2) 1 094]. 

3.6 Preparation of a-D-Desthiobiotin-s-Cv5-L-lvsine-MESNA f/\/^-(6-^(2ZV 
2-rf2E.4E)-5-n-ethvl-3.3-dimethvl-5-sulfo-3H-indolium-2-vl)penta-2.4- 
dienvlidene1-3.3-dimethvl-5-sulfo-2.3-dihvdro-1f/-indol-1-vl)hexanovlV/\^-(6- 
15 {r6-f5-methvl-2-oxoimidazolidin-4-vnhexanovnaminolhexanovn Ivsvlthio 
propane-3-sulfonic acidi 

a-D-Desthlobiotln-8-Cy5-L-lysine-OH (10mg. 8.8|.imol) was dissolved In 
anhydrous dimethylsulphoxide (2ml), PyAOP (lOmg. 19.2^mol), MESNA 

20 (5mg, 0.30mmol) and DIEA (10^1, 0.06mmoi) were added and the reaction 
mixture was stinred under nitrogen for 4hrs. The reaction mixture was purified 
by RP-HPLC; [Vydac Protein and Peptide CI 8 column; 15-40%B over45mlns 
at lOml/min; eluent A = 0.1% TFA/water, eluent B = 0.1% TFA/MeCN, 
detection at 215nm). The product containing fractions were combined and the 

25 majority of solvent removed under reduced pressure, the residue was freeze 
dried. The product was obtained as a fluffy blue solid (4mg, 37%). TLC; RP 
CI 8; 1:1 water.acetonitrile) rf = 0.76. Analytical HPLC AKTA analysis; 
Phenomenex CI 8 column; 0-50%B over 30mins at Iml/min; eluent A = 0.1% 
TFA/water, eluent B - 0.1% TFA/MeCN, detection at 650nm; rt = 21.04mins. 

30 Xmax = 648nm (PBS buffer). MS (MALDI TOF) found 1 21 9 (MM*); 
[theoretical (C57H84N7O14S4) 1218]. 
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4. Protein labelling and affinity purification 



4.1 Labelling of N temninal cysteine Grb2SH2 with g-D-desthiobiotin-s-CvS- 
L-lysine-MESNA and purification 

5 

To N-terminal cysteine Grb2SH2 (N-Cys-Grb2SH2) (200pM in PBS 
buffer pH 7.2) (200^1) was added a-D-desth!obfotin-e-Cy5-L-lysine-MESNA 
(2mM In reaction buffer) (200^1). N-Cys-Grb2SH2 was prepared using 
recombinant techniques. The reaction buffer consisted of phosphate buffer 

1 0 (200mM), pH 7.2 containing sodium chloride (200mM) and 4% MESNA. The 
reaction mixture was left at RT for 12 hrs, wrapped in foil to protect from light. 
The reaction was then quenched with dl-dithiothreitol (final concentration 
60mM). Unreacted dye was separated from labelled/unlabelled protein by 
FPLC, using a fast desalt column and eluent of PBS buffer, pH 7.4; 2ml/min. 

1 5 detection 280 nm. Protein fractions were combined and desthiobiotin-8-Cy5-L- 
lysine affinity probe labelled protein was bound to streptavidin beads (PIERCE 
Ultralink™ streptavidin). The beads were washed vigorously with both PBS 
buffer and binding buffer (PBS containing SOOmM NaCI). The product; a-D- 
desthlobiotin-e-Cy5-L-Lys-Cys-Grb2SH2 was extracted from the streptavidin 

20 beads by adding cold biotin (1 .6mM). Several extraction runs were required. 
The materials were further purified by dialysis (PIERCE Slide-a-lyser™ mini 
dialysis units, 7,000 mwco) to remove free biotin from the sample. The 
product was analysed by SDS PAGE together with the following controls (see 
Figurel): 
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Lane 1 MW marker 

Lane 2 Unligated protein control 

Lane 3 Ligation reaction mixture: a-D-destliiobiotin-s-Cy5-L-Lysine- 

IVIESNA and N-Cys-Grb2SH2 
Lane 4 Labelled/unlabeiled N-Cys-Grb2SH2 after FPLC purification 

Lane 5 Unbound protein 

Lane 6 Streptavldin bead washes 

Lanes 7,8,9 Affinity purified product 

Lane 10 Unreacted a-D-desthiobiotin-e-Cy5-L-Lysine-MESNA 



The gel was imaged using a Typhoon imager (Figure 1B) using parameters 
for Cy5 fluorescence to detect fractions containing the fluorescent label. The 
gel was then stained with Coomassie blue stain (Figure 1 A) to determine the 
5 protein containing fractions. SDS PAGE gel shows that (a) unlabelled protein 
(both factor XA and N-Cys-Grb2SH2 did not bind to the streptavidin beads 
(Figure 1A and 1B, column 5) (enriched protein stain) and (b) the product was 
removed from the streptavidin beads by adding cold biotin (Figure 1 A and 1B, 
columns 7 and 8) (both protein stain and Cy5 fluorescence). 



15 



20 



wo 2004/011556 ^ PCT/GB2003/003196 

J> -27- 

Claims 



1. A compound comprising a cyanine dye or derivative thereof containing 
at least one target bonding group selected from a carboxylic acid thioester 

5 group or a group suitable for covalent reaction with a thioester, characterised 
in that said compound includes an affinity tag covalently bound thereto. 

2. A compound according to claim 1 having the formula (I): 
1 0 P LI 1^ L2 B 

F 
(I) 

1 5 wherein: 

D is a dye selected from a cyanine dye or a derivative thereof; 
B is an affinity tag; 

F comprises a target bonding group selected from a carboxylic acid thioester 
group and a 1 ,2-aminothiol group; 
20 M is a group adapted for attaching to F; and 

U and each independently comprise a group containing from 1-40 linked 
atoms selected from carbon atoms which may optionally include one or more 
groups selected from -NR'-, -CH=CH-, -CO-NH- and phenylenyl 
groups, where R" is selected from hydrogen and Ci - C4 alkyl. 

25 

3. A compound according to claim 2 wherein each of U and contains 
from 2 to 30 atoms. 

4. A compound according to claim 2 wherein U and are independently 
30 selected from the group: 



-{(CHR%-Q-(CHR')r}s- 
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where Q is selected from: -CHR -NR'-, -0-, -CH=CH-, -Ar- and 
-CO-NH-; R' is hydrogen or Ci - C4 alkyi, p is 0 - 5, r is 1 - 5 and s is 1 or 2. 

5. A compound according to claim 4 wherein Q is selected from -CHR'-, 
5 -O- and -CO-NH-, where R' Is hereinbefore defined. 

6. A compound according to any of claims 1 to 5 wherein said affinity tag 
is selected from biotin and desthiobiotin. 

10 7. A compound according to any of claims 1 to 5 wherein said affinity tag 
is selected from his-tag, iminodiacetic acid and nitrilotriacetic acid. 

8. A compound according to any of claims 1 to 7 wherein the target 
bonding group F is a carboxylic acid thioester of formula: 



wherein U is a bond or is a group containing from 1-30 linked atoms 
20 selected from carbon atoms and optionally one or more groups selected from 
-NH-, -O- and -CO-NH-; and R" is Ci - C4 alkyI, Ce- C10 aryl, or C7- C15 
aralkyi, which may be optionally substituted with sulphonate; or is the group 
-(CH2)2-CONH2. 

25 9. A compound according to any of claims 1 to 7 wherein the target 
bonding group F is a 1,2-aminothiol group of fomiula: 



15 




S— R" 



30 




or 




wherein U is hereinbefore defined. 
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10. A compound according to any of claims 1 to 9 wherein the compound 
has the formula (II): 



11 



R4 




R5 

^R6 



10 (II) 
wherein: 

groups and R* are attached to the 1} ring structure and groups and R® 
are atUiched to the ring structure; 
n is an integer from 1 to 3; 
15 7} and independently represent the atoms necessary to complete one ring 
or two fused ring aromatic or heteroaromatic systems, each ring having five or 
six atoms selected from carbon atoms and optionally no more than two atoms 
selected from oxygen, nitrogen and sulphur; 

X and Y are the same or different and are selected from: >CR*R®, oxygen. 
20 sulphur, -CH=CH-, >N-W wherein N is nitrogen and W is selected firom 
hydrogen and the group R^°; 

at least one of groups R\ R^, R^, R"*, R^, R®, R*, R® and R^° is the group: 



25 



_L1- 



-M 



-L2- 



-B 



where B, F, M, and \} are hereinbefore defined; 

groups R^ are independently selected from hydrogen and Ci - C4 alkyi which 
may be unsubstituted or substituted with aryl, or two or more of R^ together 



30 with the group: 
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form a hydrocarbon ring system substituted with and which may optionally 
contain a heteroatom selected from -0-, -S- or >NR^, wherein and n are 
hereinbefore defined; 

remaining groups R^, R"*, R^ and R® are independently selected from the 
group consisting of hydrogen, halogen, amide, cyano. nitro, mono- or di-Ci - 
Ce alkyl-substituted amino, carbonyl, carboxyl, Ci - Ce alkyi, Ci - Ce alkoxy, 
aryl, heteroaryl, aralkyi and the group -(CH2)m-Y where Y is selected from 
sulphonate, sulphate, phosphonate, phosphate and quaternary ammonium 
and m is zero or an integer from 1 to 6; 

remaining groups R®, R® and R}° are independently Ci - Ce alkyI; and 
remaining groups R^ and R^ are independently selected from hydrogen, Ci - 
Cio alkyI, the group -(CH2)m-Y wherein Y and m are hereinbefore defined, 
and benzyl which may be unsubstituted or substituted by up to two nitro 
groups. 

11. A compound according to any of claims 1 to 9 wherein the compound 
has the formula (III): 




(III) 

wherein 

groups R^^ R^®, R^* and R^® are attached to the rings containing X and Y or. 
optionally are attached to atoms of the and ring structures; 
Z** and Z^ independently represent the atoms necessary to complete one ring 
or two fused ring aromatic or heteroaromatic systems, each ring having five or 
six atoms selected from carbon atoms and optionally no more than two atoms 
selected from oxygen, nitrogen and sulphur; 
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X and Y are the same or different and are selected from: >CR®R®, oxygen, 
sulphur, -CH=CH-, >N-W wherein N is nitrogen and W is selected from 
hydrogen and the group R^°; 

A is selected from O and NR^^ where R^® is the substituted amino radical: 



at least one of groups R®, R^, R^° R^^ R^^. R^^, R^^ R^^ and R^® is the 
group: 

—LI M- L2 B 



where B, F, M, U and are hereinbefore defined; 

remaining groups R^\ R'*^, R^^, R^"* and R^^ are independently selected from 
the group consisting of hydrogen, halogen, amide, cyano, nitro, mono- or di- 
Ci - Ce alkyl-substltuted amino, carbonyl, carboxyi, Ci - Ce alkyl, Ci - Ce 
alkoxy, aryl, heteroaryl, aralkyi and the group -(CH2)m-Y where Y is selected 
from sulphonate, sulphate, phosphonate, phosphate and quaternary 
ammonium and m is zero or an integer from 1 to 6; 
remaining groups R®, R^ and R''^ are independently Ci - Ce alkyl; 
remaining group R^^ is selected from hydrogen, Ci - C4 alkyl and aryl; and 
remaining group R^® is selected from Ci - Ce alkyl, aryl, heteroaryl, an acyl 
radical having from 2-7 carbon atoms, and a thiocarbamoyi radical. 

12. A compound according to claim 10 or claim 1 1 wherein and are 
selected independently from the group consisting of phenyl, pyridinyl, 
naphthyl, quinolinyl and indoiyi moieties. 

13. A compound according to claim 10 or claim 1 1 wherein and Z^ are 
selected from phenyl and naphthyl moieties. 
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14. A method for labelling a protein of interest wherein said protein 
contains or is derivatised to contain an N-terminal cysteine, the method 
comprising: 

i) adding to a liquid containing said protein a compound of formula (I): 



25 



M L2 B 



(I) 

10 wherein: 

D is a dye selected from a cyanine dye or a derivative thereof; 
B is a bioaffinity tag; 

F comprises a target bonding group selected from a carboxylic acid thioester 
group and a 1 ,2-aminothiol group; 

15 M is a group adapted for attaching to F; and 

U and each independently comprise a group containing from 1-40 linked 
atoms selected from carbon atoms which may optionally include one or more 
groups selected from -NR -CH=CH-, -CO-NH- and phenylenyl 
groups, where R' is selected from hydrogen and Ci - C4 alkyi; and 

20 ii) incubating said compound with said protein under conditions suitable 
for labelling said protein. 

15. A compound according to claim 14 wherein each of U and contains 
from 2 to 30 atoms. 



16. A method according to claim 14 wherein and are independently 
selected from the group: 

-{(CHR%-Q--(CHR')r}s- 



30 



where Q is selected from: -CHR-, -NR -0-, -CH=CH- -Ar- and 
-CO--NH-; R' is hydrogen or Ci - C4 alkyI, p is 0 - 5, r is 1 - 5 and s is 1 or 2. 
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17. A method according to claim 1 6 wlierein Q is selected from -CHR -, 
-O- and -CO-NH-, where R* is hereinbefore defined. 

18. A method according to any of claims 14 to 17 further comprising 
separating and/or purifying the dye-labelled protein of interest by affinity 
chromatography. 

19. A method according to any of claims 14 to 1 8 wherein said protein of 
interest is selected from antibody, antigen, protein, peptide, microbial 
materials, cells and cell membranes. 
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Figure 1/1 

SDS PAGE analysis - Coomassie blue stain and Typhoon Image 



Image A 



Factor XA 



Cy5-Lys-Cys- 
Grb2SH2 



containing fractions 



Image B 



Cy5-Lys-Cys- 
Grb2SH2 containing 
fractions 



Free Dye 
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